LIG at MediaEval 2013 Affect Task: Use of a Generic Method and Joint Audio-Visual Words
نویسندگان
چکیده
This paper describes the LIG participation to the MediaEval 2013 Affect Task on violent scenes detection in Hollywood movies. We submitted four runs at the shot level for each subtasks: objective violent scenes detection and subjective violent scenes detection. Our four runs are: hierarchical fusion of descriptors and classifier combinations, the same with joint audio-visual words, and the same two with reranking. Our reference run obtained with the official MAP@100 metric a performance of 69% for the subjective violence and 52% for the objective violence. The joint audio-visual words bring a slight improvement on the MAP@100 and they improve the precision in the head of the returned list while the temporal re-ranking improves the P@100.
منابع مشابه
LIG at MediaEval 2011 affect task: use of a generic method
This paper describes the LIG participation to the MediaEval 2011 Affect Task on violent scenes’ detection in Hollywood movies. We submitted only the required run (shot classification run) with a minimal system using only the visual information. Color, texture and SIFT descriptors were extracted from key frames. The performance of our system was below the performance of the systems using both au...
متن کاملLIG at MediaEval 2012 affect task: use of a generic method
This paper describes the LIG participation to the MediaEval 2012 Affect Task on violent scenes’ detection in Hollywood movies. We submitted four runs at the shot level: hierarchical fusion of descriptors and classifier combinations (LIG-4), the same with conceptual feedback (LIG-3), and the same two with reranking (LIG-2 and LIG-1). Our reference run obtained a performance slightly above the me...
متن کاملNII-UIT at MediaEval 2013 Violent Scenes Detection Affect Task
We present a comprehensive evaluation of shot-based visual and audio features for MediaEval 2013 Violent Scenes Detection Affect Task. To obtain visual features, we use global features, local SIFT features and motion features. For audio features, the popular MFCC is employed. Besides that, we also evaluate the performance of mid-level features which is constructed using visual concepts. We comb...
متن کاملMediaEval 2011 Affect Task: Violent Scene Detection combining audio and visual Features with SVM
We propose an approach for violence analysis of movies in a multi-modal (visual and audio) manner with one-class and two-class support vector machine (SVM). We use the scale-invariant feature transform (SIFT) features with the Bag-of-Words (BoW) approach for visual content description of movies, where audio content description is performed with the mel-frequency cepstral coefficients (MFCCs) fe...
متن کاملThe TUM Approach to the MediaEval Music Emotion Task Using Generic Affective Audio Features
This paper describes the TUM approach for the MediaEval Emotion in Music task which consists of non-prototypical music retrieved from the web, annotated by crowdsourcing. We use Support Vector Machines and BLSTM recurrent neural networks for static and dynamic arousal and valence regression. A generic set of acoustic features is used that has been proven effective for affect prediction across m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013